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mulated, furnished,  or  in  any  way  supplied  the  said  drawings,  specifications, 
or  other  data,  is  not  to  be  regarded  by  implication  or  otherwise  as  in  any 
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that  may  in  any  way  be  related  thereto. 
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A sample  is  said  to  be  homogeneous,  if  all  its  elements  are  drawn  from 
the  same  population,  otherwise  heterogeneous.  It  is  self-evident  that  there  is 
no  sense  in  estimating  the  parameters  of  an  assumed  distribution  from  hetero- 
geneous semples.  Considering  the  fact  that  samples  of  fatigue  test  data  are 
frequently  composed  of  elements  drawn  from  two  or  even  three  populations,  it  is 
an  important  rule,  frequently  violated,  to  start  statistical  analyses  of  a giver 
sample  by  stating  whether  it  is  homogeneous  or  not. 
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FOREWORD 


The  research  work  reported  herein  was  conducted  by  Prof. 
Dr.  Waloddi  Weibull,  Avenue  d'Albigny,  9 bis,  74000  Annecy, 
France,  under  USAF  Contract  No.  F44620-73-C-0066.  This  con- 
tract, which  was  initiated  under  Project  No.  7351,  "Metallic 
Materials",  Task  735106,  "Behavior  of  Metals",  was  administer- 
ed by  the  European  Office  of  Aerospace  Research.  The  work 
was  monitored  by  the  Metals  and  Ceramics  Division,  Air  Force 
Materials  Laboratory,  Air  Force  Systems  Command,  Wright-Pat- 
terson  Air  Force  Base,  Ohio  45433,  under  the  direction  of  Mr. 
W.  J.  Trapp,  AFML/LL. 

This  report  covers  work  conducted  during  the  period  15 
December  1973  to  15  January  1974.  The  manuscript  was  submit- 
ted by  the  author  for  publication  in  March  1974. 
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1.  INTRODUCTION 


A sample  is  said  to  be  homogeneous,  if  all  Its  elements  are  drawn  from  the 
same  population,  otherwise  heterogeneous.  It  is  self-evident  that  there  is  no 
sense  in  trying  to  estimate  the  parameters  of  an  assumed  distribution  function 
from  heterogeneous  samples.  Considering  the  fact  that  samples  of  fatigue  test 
data  are  quite  frequently  composed  of  elements  drawn  from  two  or  even  three 
different  populations,  it  is  an  important  rule,  frequently  violated,  to  start 
the  statistical  analysis  of  a given  set  of  test  data  by  stating  whether  the 
underlying  samples  are  homogeneous  or  not. 

The  chi-square  test  has  been  used  for  such  investigations,  but  this  test 
requires  very  large  samples.  The  present  test  is  quite  simple,  and  it  is  not 
restricted  with  regard  to  the  size  of  the  sample. 


2.  THE  PRINCIPLE  OF  THE  TEST  I 

1 

Consider  a sample  of  size  N with  its  elements  xi,...,x  rearranged  in  j 

ascending  order  of  magnitude.  If  all  these  order  statistics  belong  to  the  ; 

same  population,  and  the  unknown  parameters  are  estimated  by  use  of  a reliable  j 

method,  say,  the  maximum  likelihood  method,  applied  alternatively  to  the  com- 
plete sample  and  to  (N  - 1)  of  its  elements,  for  instance,  by  excluding  the 

largest  order  statistic  x^,  then  no  significant  differences  between  the  two  j 

sets  of  estimates  will  be  expected.  If  a significant  difference  appears,  then  | 

it  may  be  concluded  that  the  value  xN  does  not  belong  to  the  same  population 

as  the  main  part  of  the  other  (N  - 1;  order  statistics.  ] 

Let  F(x,m,8,p)  be  the  distribution  function  of  the  population  and  let  j 

f(x,m,B,u)  be  the  density  function  of  a single  value  drawn  from  it,  where 

m,  8,  and  u are  parameters.  j 

Following  formulae  presented  by  Sarhan  and  Greenberg  (1) , the  density  ! 

function  of  the  joint  distribution  of  the  first  k order  statistics,  which  i 

will  be  called  the  likelihood  equation  of  the  k-truncated  sample  and  denoted  I 

by  L^,  is  given  by  1 

i 

Lk  " (N  - k) ! {1  " F(xk)}N  k * ”•  f(xk>  (1)  I 

| 

which  for  k ■ V takes  the  form 


LN  - (N!)  f(x1)...f(xN)  (2) 

The  present  test  now  consists  in  computing  the  particular  set  of  esti- 
mates (A, 8,0),  which  maximizes  the  likelihood  equations  L^  and  L^_^ , possibly 


A 


i 
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also  L,  «,  etc.,  and  stating  whether  there  are  significant  differences  be- 
tween the  alternative  sets  of  estimates.  The  term  "significant"  will  be  de- 
fined in  the  sequel. 

The  further  calculations  will  be  applied  to  the  Weibull  distribution. 

3.  APPLICATIONS  TO  THE  WEIBULL  DISTRIBUTION 


3.1  General  Formulas 

In  this  particular  case,  we  have 

P - F(x,m,0,y)  - 1 - e"(x_p)  /6  (3) 


and 


f(x,m,6,y)  - (m/0n) (x-y)m  *.  e ^ (A) 

Introducing  (A)  into  (1) , we  have 

\ "(N  -Nk)!  C(*i  - P)...(xk  - p))®-1  . (5) 

e-((xx  - y)m  +...  + (N-k+l)(xk  - y)m) 


and 

In  L^  » In  -^~"~*~ky  t + k(ln  m)  - km  (ln0)  + (m  - 1)  (In  (x^  - y)  + ... 

+ ln(xk  - y))  - ((Xl  - y)m  + ...  + (N-k+1)^  - u)")/*"  (6) 

From  the  condition  31^/30  ■ 0,  we  obtain 

B®  - C(xx  - y)“  + ...  + (N  - k + l)(xk  - y)®)/k  (7) 

Introducing  (7)  into  (S)  we  have,  neglecting  factors  depending  only 
on  N and  k. 


2 


- p)  ...  (xfc  - p))m  1 

^ ((x2  - p)“  + ...  + (N  - k + l)(xk  - w)m)k 


(«) 


which  for  k * N cakes  the  form 


ok((x1  - p>  ...  (xN  - p))™  1 

((xj^  - p)m  + ...  + Cstjj  - p)m)N 


(9) 


The  unknown  parameters  m and  p may  be  estimated  by  equating  to 
zero  the  partial  derivatives  of  L^,  that  is 

ai^/am  - 0,  31^/911  - 0 (10) 

3.2  Criteria  for  Significant  Differences 
3.2.1  Parameter  yi  known 

Without  loss  of  generality,  we  may  put  p - 0 and  equation 
(8)  then  takes  the  form 


Hr  m 


k,  .m-1 

m (Xj 


(ID 


Xj,  + ...  + (N  - k + Dxk) 


and 


In 


» k(ln  m)  -f  (m  - l)(ln  x^  + ...  + In  x^)  - kin  (x^m  + ... 


+ (N  - k + l)xkra) 


(12) 


The  likelihood  1^  will  be  maximized  by  the  particular  value  £ which 
satisfies  the  equation 


3ln  Lk  / 3m  ■ 0 


(13) 


that  la. 
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The  sampling  distribution  of  the  estimate  A may  be  determined  by  gener- 
ating a large  number  of  random  samples  from  the  actual  population  and  compu- 
ting the  corresponding  values  of  & for  each  of  them  by  use  of  equation  (14) . 
Observing  that  the  probability  P in  equation  (3)  is  uniformly  distributed  over 
the  interval  (0,1),  the  elements  of  a sample  from  a Weibull  population  with 
the  parameters  (mo,8Q)  is  obtained  by  taking  a sample  of  random  sampling  num- 
bers r.(0,l)  and  computing  x.  from 


x±  • 8Q(-ln(l  - ri))1^mo 


(15) 


ift 

Introducing  (15)  into  (14),  the  parameter  6 disappears  and  x will  be 
replaced  by  0 


x±m  - (-ln(l  - rA)) 


(16) 


which  Implies  that  the  sampling  distribution  of  m/m  is,  for  any  given  value 
of  k,  parameter- free  and  thus  uniquely  given  for  each  combination  of  N and  k. 

Denoting  the  estimate  of  m for  k ■ N by  n.,  for  k ■ N - 1 by  L,  for  k - 
N - 2 by  nL,  etc.,  it  can  be  concluded  that  the  quotients  m^/ifi^,  m /m  , etc*» 
have  sampling  distributions  which  are  independent  of  the  true  popufatioi 


meter  mQ  and  uniquely  given  for  each  sample  size  N. 


Lon  parai 


From  these  distributions,  quotients  m./tfu,  tru/m-,  corresponding  to  pre- 
assigned levels  of  significance,  say,  2X  or  5*,  will  De  determined.  The  hy- 
pothesis of  homogeneity  will  be  rejected  for  samples  yielding  larger  quotients. 

3.2.2  All  Parameters  Unknown 

If  none  of  the  parameters  is  known,  it  has  been  found  convenient  to 
estimate  the  parameters  by  computing  the  values  of  or  in  equations  (8) 
and  (9)  for  an  appropriate  set  of  pairs  (m,u)  and  to  select 
pair  (m,u)  which  yields  the  largest  value  of  or  L^. 


that  particular 


The  computer  program  6/73  has  been  written  for  computing  L^.  Observing 
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that 


0 < u < 


(17) 


this  program  computes 


the  eleven  values  of  m which  maximize 


for 


V - xi(i/10)  , i - 0(1)10 


(18) 


and  corresponding  values  of  log  and  ■ xq. 

The  computing  time  of  this  procedure  is  about  one  second  for  sample  size 
N - 10  with  the  computer  IBM  360,  M/75. 

This  program  will  now  be  extended  to  produce  the  same  set  of  data  for 
k « N - 1,  N - 2,  etc.,  and  to  present  the  eleven  quotients  m^/n^,  etc.  In 
cases  where  all  the  quotients  are  larger  than  the  assigned  levels  of  signifi- 
cance, the  hypothesis  of  homogeneity  will  be  rejected.  If  some  of  the  quo- 
tients have  acceptable  values,  the  decision  depends  on  the  location  of  these 
values.  If  they  are  far  from  the  place  of  maximum  likelihood,  they  will  moti- 
vate a rejection.  If  not,  the  hypothesis  may  be  accepted  or  examined  by  other 
tes  ts . 
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